Picture for Zhen Xiang

Zhen Xiang

Palette: A Modular, Controllable, and Efficient Framework for On-demand Authorized Safety Alignment Relaxation in LLMs

Add code
May 22, 2026
Viaarxiv icon

Crafting Reversible SFT Behaviors in Large Language Models

Add code
May 07, 2026
Viaarxiv icon

Green Shielding: A User-Centric Approach Towards Trustworthy AI

Add code
Apr 27, 2026
Viaarxiv icon

ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

Add code
Apr 06, 2026
Viaarxiv icon

Q-realign: Piggybacking Realignment on Quantization for Safe and Efficient LLM Deployment

Add code
Jan 13, 2026
Viaarxiv icon

RadFabric: Agentic AI System with Reasoning Capability for Radiology

Add code
Jun 17, 2025
Viaarxiv icon

CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents

Add code
May 29, 2025
Viaarxiv icon

SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge

Add code
May 27, 2025
Viaarxiv icon

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior

Add code
May 21, 2025
Figure 1 for How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
Figure 2 for How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
Figure 3 for How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
Figure 4 for How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
Viaarxiv icon

Doxing via the Lens: Revealing Privacy Leakage in Image Geolocation for Agentic Multi-Modal Large Reasoning Model

Add code
Apr 29, 2025
Viaarxiv icon